CDS
Accession Number | TCMCG075C24188 |
gbkey | CDS |
Protein Id | XP_007019116.2 |
Location | join(6683699..6683923,6685909..6685977,6686078..6686174,6686292..6686498,6687742..6687857,6687936..6688040,6688119..6688291,6688381..6688471,6688949..6689101,6689875..6690108) |
Gene | LOC18592364 |
GeneID | 18592364 |
Organism | Theobroma cacao |
Protein
Length | 489aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007019054.2 |
Definition | PREDICTED: aldehyde dehydrogenase family 3 member H1 isoform X2 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGCGAGAGAAGTGGAGAAGAAAGCGGTTTTCGATACGGACTCGGCCAAGGAGGTGGTGAAGGAGTTGAGAGCTAGCTTTGTTGCTGGAAAAACTAAAAGCTACGAATGGAGAGTTGCTCAGTTGAAAGCCTTGTTGAAGATGACTGAAGAGAACGAGCCGCAAATCGCCGCCGCCCTTCGCGACGATCTTTCCAAGCCGGAACTCGAATCCTACATCTACGAGATAGCAATGTTGAAGAGCTCATGTAGATTGGCACTCAAGGAAATGAAGCGTTGGATAATGCCAGAAAGGGCAAAAACTTCGTTGACTACATTTCCTTCATCTGCTGAAATTGTATCTGAGCCATTGGGTGTTGTGCTAGTAATATCAGCATGGAATTATCCTTTTTTGTTGTCCCTTGATCCAGTTGTTGGAGCTATTGCAGCCGGTAATGCTGTAGTCTTAAAGCCATCAGAAATTGCTCCAGCCACAGCATCATTGCTTGCAAAGCTGGTAGCCAATTATTTGGATAGCTCTTGCATAAAGGTTGTTGAAGGGGCTGTTTCTGAAACATCAGCACTTCTGGAGCAGAAGTGGGACAAAATATTTTATACAGGCAATGGAAGAGTTGCACGCATTGTGATGGCAGCTGCTGCAAAGCACCTAACACCTGTTGTTTTGGAGCTTGGAGGAAAATCTCCAGTCATTGTTGATTCAGGCATCAATTTACAGGTTGCAACGAGGCGGATTATTGCGGGCAAGTGGGGGTGTAATAATGGACAAGCATGTATTTCTCCTGACTACATTATTACAACAAAAGATTATGCTCCAAAGTTGCTAGATTCTTTCAAACGTGAATTGGAGCAGTTTTATGGAAAGAATCCGCTGGAGTCAAAAGACTTATCTCGCATAGTGAATTCGAACCACTTTGCTCGCTTGTCAAAGCTCTTGGATGAGGACAAGGTGTCTGGTAAAATCGTCCATGGAGGTGAAAGAGACAAAAACAACTTGAAGATTGCTCCCACTATCTTGCTTGATGTCCCACTAGATTCTCTGATCATGAATGAAGAGATATTTGGTCCATTGCTTCCAATTATCATGGTTGACAAAGTGGAAGACAGCTTTGATGTGATAAATTCTTCAGGAACAAAGCCATTAGCAGCATATCTGTTTACCAATAAGGAGAAGCTGAAAGAGAAGTTTGTTGCGACAGTCTCTGCAGGGGGTTTGGTTGTCAATGACACGACTGTACATCTTGCTGAACACACTTTACCATTTGGAGGAGTCGGGGACAGCGGAATGGGTGCATACCATGGGAAATTCTCCTTTGATGCTTTTAGCCATAAGAAGGCTGTTCTTTATAGAGGTTTTGCTTGTGATGCATTTGTGAGATACCCACCATACACAAGGCGAAAGCTAAGATTGTTGCAGGCTCTTCTTGGTGGTAGTTTATTAAGCATAATCCGAGCATTGCTGGGATGGTCTTAG |
Protein: MAREVEKKAVFDTDSAKEVVKELRASFVAGKTKSYEWRVAQLKALLKMTEENEPQIAAALRDDLSKPELESYIYEIAMLKSSCRLALKEMKRWIMPERAKTSLTTFPSSAEIVSEPLGVVLVISAWNYPFLLSLDPVVGAIAAGNAVVLKPSEIAPATASLLAKLVANYLDSSCIKVVEGAVSETSALLEQKWDKIFYTGNGRVARIVMAAAAKHLTPVVLELGGKSPVIVDSGINLQVATRRIIAGKWGCNNGQACISPDYIITTKDYAPKLLDSFKRELEQFYGKNPLESKDLSRIVNSNHFARLSKLLDEDKVSGKIVHGGERDKNNLKIAPTILLDVPLDSLIMNEEIFGPLLPIIMVDKVEDSFDVINSSGTKPLAAYLFTNKEKLKEKFVATVSAGGLVVNDTTVHLAEHTLPFGGVGDSGMGAYHGKFSFDAFSHKKAVLYRGFACDAFVRYPPYTRRKLRLLQALLGGSLLSIIRALLGWS |